Sequence Features and Subset Selection Technique for the Prediction of Protein Trafficking Phenomenon in Eukaryotic Non Membrane Proteins

نویسندگان

  • Geetha Govindan
  • Achuthsankar S Nair
چکیده

Protein trafficking or protein sorting is the mechanism by which a cell transports proteins to the appropriate position in the cell or outside of it. This targeting is based on the information contained in the protein. Many methods predict the subcellular location of proteins in eukaryotes from the sequence information. However, most of these methods use a flat structure to perform prediction. In this work, we introduce ensemble methods to predict locations in the eukaryotic protein-sorting non membrane pathway hierarchically. We used features that were extracted exclusively from full length protein sequences with feature subset selection for classification. Sequence driven features, sequence mapped features and sequence autocorrelation features were tested with ensemble learners and classifier performances were compared with and without feature subset selection technique.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rab11 in Disease Progression

Membrane/ protein trafficking in the secretory/ biosynthetic and endocytic pathways is mediated by vesicles. Vesicle trafficking in eukaryotes is regulated by a class of small monomeric GTPases the Rab protein family. Rab proteins represent the largest branch of the Ras superfamily GTPases, and have been concerned in a variety of intracellular vesicle trafficking and different intracellular sig...

متن کامل

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

A Novel Vector for Expression/Secretion of Properly Folded Eukaryotic Proteins: a Comparative Study on Cytoplasmic and Periplasmic Expression of Human Epidermal Growth Factor in E. coli

Expression of eukaryotic proteins in E. coli often results in their aggregation. Proper folding and solubility of therapeutical proteins are the pre-requisite for their bioactivity. This is not achieved in cytoplasmic expression in E. coli because of the absence of disulfide bonds formation. A novel expression/secretion vector was constructed which exploited β-lactamase signal sequence to trans...

متن کامل

Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467

Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...

متن کامل

Syntaxin 1 is expressed in the trout saccular hair cells: RT-PCR and immunocytochemical observations

Syntaxin is one of several proteins that may be involved in the docking of synaptic vesicles, synaptic vesicle recycling, and non-synaptic membrane trafficking. Presence of syntaxin has been reported in rat auditory and vestibular end organs. In the current study, we have examined the expression of message for syntaxin 1 in hair cells of the sacculus of the rainbow trout, Oncorhynchus mykiss, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015